Downloads
You can download metadata in long or wide format and taxonomic profiles. This database is made available under the Open Database License. This means that you can build upon this database, but have to attribute this website and share-alike according to the terms of the license. Any rights in individual contents of the database are licensed under the Database Contents License.
The majority of samples in Metalog are available at ENA/SRA/DDBJ, although some are from MG-RAST and CNSA. Mappings of Metalog identifiers to samples, runs, and experiments are available in this mapping file. An external sample accession can actually correspond to several biological samples, and vice versa sometimes sequences belonging to the same actual sample have been submitted under multiple sample accessions. For this reason, we use our own sample identifiers throughout Metalog.
We have prepared an R script as an usage example that downloads data from Metalog and looks for associations between bacteria and medication in the gut microbiome of adults.
Metadata in long format
Metadata in long format with one row per sample and metadata item.
Environmental
Metadata in wide format
Metadata in wide (matrix-like) format with one row per sample. As this contains many partially empty columns, we do not generate the download file containing all study-specific metadata fields in this format.
Human
Animal
Ocean
Environmental
Taxonomic Profiles
Taxonomic profiles can be downloaded for these profilers: mOTUs 3.0, MetaPhlAn 4.0 (database version: vJun23_202307; mapping between clade names and full lineages and NCBI Taxonomy identifiers), and mOTUs based on the species in the SPIRE database. For adult human fecal samples, enterotype predictions (including the Enterotype Dysbiosis Score) are available. We will also add the predicted fecal microbial load as soon as possible.
Taxonomic profiles may be unavailable for a number of reasons, e.g. missing or empty sequencing data, but also delays in processing after metadata curation. We are aware that some studies are missing a large number of taxonomic profiles and are working on adding these. Known to be missing are mOTUs 3 profiles for:
- Human: 32779 samples across 181 studies ( list of missing samples)
- Animal: 1261 samples across 45 studies ( list of missing samples)
- Ocean: 563 samples across 25 studies ( list of missing samples)
- Environmental: 8778 samples across 127 studies ( list of missing samples)